Text Detection and Translation from Natural Scenes

نویسندگان

  • Jiang Gao
  • Jie Yang
  • Ying Zhang
  • Alex Waibel
چکیده

We present a system for automatic extraction and interpretation of signs from a natural scene. The system is capable of capturing images, detecting and recognizing signs, and translating them into a target language. The translation can be displayed on a hand-held wearable display, or a head mounted display. It can also be synthesized as a voice output message over the earphones. We address challenges in automatic sign extraction and translation. We describe methods for automatic sign extraction. We extend example-based machine translation technology for sign translation. We use a user-centered approach in the system development. The approach takes advantage of human intelligence if needed and leverages human capabilities. We are currently working on Chinese sign translation. We have developed a prototype system that can recognize Chinese signs input from a video camera that is a common gadget for a tourist, and translate the signs either into English text or a voice stream. We have built up a database containing about 800 Chinese signs for development and evaluation. We present evaluation results and analyze errors. The sign translation, in conjunction with spoken language translation, can help international tourists to overcome language barriers. The technology can also help a visually handicapped person to increase environmental awareness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segmentation Framework for Multi-Oriented Text Detection and Recognition

Here in this paper a new and efficient technique for the text detection from natural scenes is implemented. The proposed methodology is based on the concept of Otsu’s segmentation method which segments the higher intensity texts from the natural scenes. Although there are various text detection techniques implemented, but the proposed methodology implemented here for text detection provides hig...

متن کامل

Translation and Hybridity in Scenes and Frames Semantics

 The present study is a theoretical attempt to illustrate how Fillmore's Scenes and Frames Semantics (SFS) could be employed as a framework to portray the process of understanding and translating hybrid texts. It first reviews the origin of SFS; then it maps SFS onto Nida’s linguistic model of translation process and the Interpretive Theory of Translation; it examines in the next section, withi...

متن کامل

English-Persian Plagiarism Detection based on a Semantic Approach

Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...

متن کامل

Natural scene text localization using edge color signature

Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...

متن کامل

A Morphological Image Preprocessing Suite for OCR on Natural Scene Images

As demand grows for mobile applications, research in optical character recognition (OCR), a technology well-developed for document imaging, is shifting focus to the recognition of text embedded in digital photographs or video. Segmenting text and background in natural scenes is a difficult classification problem, and the accuracy of this segmentation is of utmost importance when the output of a...

متن کامل

Automatic Detection of Signs with Affine Transformation

In this paper, we propose an approach for detecting signs from natural scenes. The approach efficiently embeds multi-resolution, adaptive search, and affine rectification algorithms in a hierarchical framework, with different emphases at each layer. We combine multi-resolution and multi-scale edge detection techniques to effectively detect text in different sizes. Different from the existing ap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001